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MODEL-INDEPENDENT BOUNDS FOR ASIAN OPTIONS: A 
DYNAMIC PROGRAMMING APPROAGH 

ALEXANDER M. G. COX AND SIGRID KALLBLAD 


Abstract. We consider the problem of finding model-independent bounds on 
the price of an Asian option, when the call prices at the maturity date of 
the option are known. Our methods differ from most approaches to model- 
independent pricing in that we consider the problem as a dynamic program¬ 
ming problem, where the controlled process is the conditional distribution of 
the asset at the maturity date. By formulating the problem in this manner, we 
are able to determine the model-independent price through a PDE formula¬ 
tion. Notably, this approach does not require specific constraints on the payoff 
function (e.g. convexity), and would appear to generalise to many related 
problems. 


1. Introduction 


Since the seminal paper of Hobson 36 , there has been substantial interest in 


questions of the following form: given an asset with price (S't)tG[o,T]j a derivative 
whose payoff, Xt, depends on the path of the asset, and the prices of call options at 
maturity time T, find a static portfolio of calls, and a dynamic trading strategy in 
the asset which superhedges the derivative at time T, under (essentially) any model 
for the asset. The class of models considered are usually very large (for example, 
all models with continuous paths), and so the resulting price is usually called the 
model-independent superhedging price. 

The problem of finding the model-independent superhedging price is closely re¬ 
lated to the problem of identifying the largest model-based price: specihcally, in 
a classical setting, one would expect the prices of all options to be given as the 
expected value under some risk-neutral measure}^ and by specifying the call prices 
at time 0, the distribution of St under this risk-neutral measure is determined. It 
is therefore natural to conjecture that the mo del-independent superhedging price is 
equal to supQ E*® [Xt] , where the supremum is taken over all probability measures 
Q such that iSt)t^io,T] is a martingale, and St has the distribution determined 
by the call options. Recently a number of papers, starting with Beiglbock, Henry- 
Labordere, and Penkner [10] in discrete time, and followed up by Dolinsky and 
Soner 25 in continuous time (see also [^[^|9 [^1^1^), have made this result 
explicit under a variety of technical conditions. Note that in this formulation it 
is very natural to consider the supremum over the set of probability measure as a 
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primal problem, and the infimum over the class of super-hedging strategies as the 
corresponding dual problem. 

An alternative approach to these problems, following Hobson [^, is to use a 
time-change argument to reformulate the primal problem in terms of an optimisa¬ 
tion over solutions to the Skorokhod embedding problem (SEP): that is, to argue 
that, up to an unknown time change rt, the martingale St = is a time change 
of a Brownian motion. For a number of important quantities (maximum, quadratic 
variation, local time, ...) the values of these quantities for the asset price up to 
time T and for the Brownian motion up to the (stopping time) tt agree. It can 
often then be argued that the choice of a model for St with given law, and the 
choice of a stopping time tt are equivalent provided Brj. has the required distri¬ 
bution (and satisfies an integrability constraint). The latter problem is well known 
as the Skorokhod embedding problem. A common approach to solving the model- 
independent superhedging problem is then to consider the corresponding Skorokhod 
embedding problem. If an optimal solution to this problem can be found, then it is 
often possible to guess the correct solution to the corresponding dual problem, and 
interpret this in terms of a superhedging strategy. This approach has been used in 
e.g. 14 ,16 17 18 19 20 34 ,37 ,38 42 ; see also the survey article of Hobson 35 


Of note in all of the known optimal solutions to the SEP is that some under¬ 
lying structure is required on the form of the option payoff: for example, if we 
write Mt := supj(<j B^ for the maximum process, then the optimal constructions 
are known to maximise E[F(Mt-)] over solutions to the SEP, provided that F is 
monotonic. To the best of our knowledge, the optimal construction when F is not 
monotonic is not known. Similarly, in the case of variance options or the local 
time, the function F must be concave/convex in order to have a known optimal 
solution. In Beiglbdck, Cox, and Huesmann [^, this behaviour was explained in 
terms of a natural convexity property which holds when a path-swapping operation 
is performed. It follows from this operation that many constructions of solutions to 
the SEP are optimal when the payoff to be optimised has such a convexity property. 
However without the corresponding convexity, a ‘nice’ description of the optimal 
solution seems impossible. One of the key strengths of the results described in this 
paper is that our methods are not constrained by such a convexity assumption on 
the payoff, and therefore will work for very general payoffs. 

The main results in this paper concern the case where the option described above 
is an Asian option, that is, Xt = F{At), where At = fg Sudu (we omit the usual 
scaling factor, j for notational ease), and we consider the primal version of the 
problem, that is, we look to maximise E [F(At)] over all price processes (Su)uelo,T] 
which are martingales, and which satisfy a constraint on the terminal law, St ~ fJ-- 
Notably, the Asian example already falls outside the case of payoffs which can 
easily be handled by SEP methods, since the whole time-change (T„)„g[o_T]i 
not just the final time, tt, is already important in determining the value of At- 
However, in the case where the function F is convex, the optimal model is still easily 
determined: essentially, the asset will jump to its terminal distribution immediately, 
and the manner in which this is done (the ‘embedding component’) turns out to 
be irrelevant. This result was first given in Stebegg 45 , which, to the best of our 


knowledge, is the first paper to characterise optimality in a setting where the SEP 
approach fails, or more generally to consider a problem of this form in continuous 
time without using the SEP approach. The standing assumptions in [45| are slightly 
different to ours — essentially, 45 allows a slightly more general setup (general 
starting measures, and discrete and continuously monitored payoffs are included) 
at the cost of considering only convex payoff functions (see also Section [4T| ) . At 
a heuristic level, this restriction to convex functions in |45] appears comparable to 
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the convexity constraint described above for the SEP in determining the ‘simple’ 
set of optimisers. We also observe that there is a long history of considering model- 
independent bounds for the prices of Asian options (e.g. illllllllllllllT]), 
although we note that, in contrast to the case considered in this paper, existing 
results tend to consider discretely monitored Asian options, often when call options 
on the underlying are traded at some or all intermediate maturities. 

The novelty of our approach relates to the manner in which we formulate the 
problem as a dynamic programming problem. In particular, we include the condi¬ 
tional law of the final value of the asset price in the formulation of our problem. 
The condition that the process is a martingale with this conditional law is then for¬ 
mulated in terms of the behaviour of the conditional law. Specifically, we require 
the conditional law to be a measure-valued martingale, by which we mean that 
is required to be a martingale for any (Borel) set A. We will show in 
Sectionl^that this condition is equivalent to the original formulation. In particular, 
by requiring = /^ and requiring to be singular, we enforce the condition that 
the terminal law oi St = f x^tidx) is /i. The concept of a measure-valued martin¬ 
gale is classical, (see e.g. Dawson [^, Horowitz 39 ; in this literature, the rather 


confusing terminology ‘martingale measure’ is also common), and has appeared in 
the context of the SEP in Eldan . A key result for our purposes is that we are 
able to show that our value function is continuous in where the space of mea¬ 
sures is equipped with the Wasserstein topology. This allows us to approximate 
^ by atomic measures, which enables us to reduce the whole problem to a finite¬ 
dimensional problem, at which point classical methods can be used ( Section]^. We 
note that, in this discrete formulation, our problem could be compared to (a special 
case of) the problems considered in Bouchard and Nutz 
Zitkovic 


29 


46 


12 , El Karoui and Tan 


, although we prove our results via more direct, classical methods. 
We also remark that Galichon, Henry-Labordere, and Touzi [32| have also previ¬ 
ously used a stochastic control approach to solve a similar problem, but in a rather 
different manner to the approach of this paper. In Section we are able to use 
these results to provide concrete solutions to certain problems. 

We believe that the methods and ideas we describe in this paper can be ap¬ 
plied far beyond the case of Asian options. However, the Asian option setting does 
provide us with some useful structure which we are able to exploit in the construc¬ 
tion and formulation of our problem. In particular, it is easy to show that ‘small’ 
changes in the conditional terminal law result in small changes in the value function 
for the problem, the increase in the average, dAt, is easy to write in terms of the 
current conditional law, and also our underlying problem is not strongly affected 
by jumps in the process: particularly, the value function for the problem where the 
path is assumed to be continuous, and the problem where the path is assumed to 
be cadlag are identical (although optimisers may exist in the cadlag formulation, 
and not in the continuous formulation). In Section]^ we discuss further extensions. 


2. Problem formulation using measure-valued martingales 

Consider the following problem: we have an asset (*S't)tG[o,T] in a market with 
a riskless bank account and a time-horizon T, and we wish to find a model- 
independent super-hedge of an option which pays the holder F{At), where ^At = 
/(f St dt is the running averag^ We will consider the problem where the law of 


^We use the slightly unconventional notation At = f(f St dt to avoid an unnecessary number 
of terms of the form T in all our calculations; it is clear that this is just a scaling factor and can 
be removed. 
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the underlying asset at maturity, St, is given at time 0, and we consider the pri¬ 
mal optimisation problem: that is, to find the law of the process which maximises 
E[_F(A'r)] subject to St ^ Here, we consider the case where the interest rate 
p = 0, although the extension to constant interest rates is straightforward. 

Our basic approach is to consider the problem as a dynamic programming prob¬ 
lem where the current state includes the conditional distribution of the process at 
maturity. Specifically, we assume St G R+, and with A^(IR_|_) the set of Borel 
measures on M_|_, we write 

(2.1) := {n G A1(M+) : p(ffi+) = 1, / \x\ p(dx) < oo}. 


Our aim is to set the problem up as a dynamic programming problem. We suppose 
that the problem evolves on an artificial time horizon, r > 0, on which a measure¬ 
valued process (^r)r>o,^r G evolves. We let {Tr)r>o be an increasing process in 
[0,T]. Our interpretation of this quantity is that Tj. represents the ‘real’ time at 
the artificial time r. Roughly, the slower Tr increases, the higher ‘volatility’ we see 
in the real-time scale. We set the problem up in this way, since we wish to allow 
a substantial change in the r time-scale to happen instantaneously in real time, 
which may correspond to jumps in the asset price. However, we wish to maintain a 
‘continuous’ evolution of the measure-valued process over its natural time-scale (we 
do not wish to deal with jumps in the measure-valued process). The choice of the 
increasing processes (T^.) will form part of the control of the problem — specifically, 
we optimise over G [0,1] and define 

(2.2) Tr= [ A^ds. 

Jo 

The second part of the control will be the choice of the measure-valued process 

This process will determine the conditional distribution of the asset {St). Specif¬ 
ically, the initial value is where /i is the terminal law of the asset at time 

0, and over time we suppose that (^r) evolves in a manner that ensures that {St) 
remains a martingale. 

Definition 2.1. We say that an adapted process (^r)r>o with G 7^^ is a measure¬ 
valued martingale if, for any / G Cb{M.+ ), ^.(/) := / f{x)C{dx) is a martingale. 


Note trivially that, if / G C'6(K+), then ^r{f) is bounded for each t, and hence the 
martingale ^. (/) is uniformly integrable, with well defined limit ^oo(/) (in particular, 
^oo is a measure; see 39 Proposition 2.1]). 

Remark 2.2. An adapted process {^r) with G is a measure-valued martingale 
if and only if (A) is a martingale for any A G H(]R). Indeed, the indicator function 
over an interval of K may be approximated by continuous functions, and an appli¬ 
cation of the monotone class theorem yields that the claim holds for any A G B{M.) 
(see Lemma [2.12 for a similar argument). Conversely, any / G C'b(K+) may be 
approximated from below by simple functions. In fact, by the same argument, we 
see that ^. (/) is a martingale for any (non-negative) measurable function. 


Remark 2.3. Our underlying probability spaces will generally be assumed to sat¬ 
isfy the usual conditions. Under this assumption, of course, for every / G Cb{M.+ ), 
the martingale C{f) has a cadlag version. More pertinently, we can choose a ver¬ 
sion of {^r) such that C{f) is cadlag for every bounded Borel function /, see 
Theorem 2.5]. In what follows, we will assume that we always take this version of 
{&)■ 


We will think of measure-valued martingales as processes, evolving in time. We 
emphasise that the support of the measure-valued martingale can only ever de¬ 
crease: if ^ro(^) = 0 then ^r(A) = 0 for all r > rg. In the particularly nice case 
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that ^ro is an atomic measure, then for all r > tq, the measure will also be atomic, 
and supported on the same set of points. In particular, the spatial distribution of 
such a measure will not change, only the weights attributed to each point. Since 
the weight associated to each point is a martingale and constrained to lie in the 
interval [0,1], it follows that in the limit as r —)■ oo, the weight assigned to each 
point must converge to a limit; commonly, this limit will be assumed to be either 
0 or 1, and this motivates the following definitions. Consider the set of singular 
measures on K+, = {/i e AI(K-i-) : fi = 5y,y € K-i-}, then; 

Definition 2.4. We say that a measure-valued martingale {^r) is terminating if 
Ci- —t ^oo G P® a.s. as r —>■ oo, where the convergence is in the sense of weak 
convergence of measures. It is finitely terminating if := inf{r > 0 : S 7^®} is 

almost surely finite. 


Lemma 2.5. Suppose (fr) is a terminating measure-valued martingale withf^Q = y,. 
Then X. = f xf.(dx) is a non-negative UI martingale with Xoo ~ y- 


Proof. The martingale property follows from Remark |2.2| In particular, for y G K-i-, 
we have that 


E[(j/-Xoo)-h] =E 



a;)+^oo(da;) 



x)+ /r(da;). 


Since E [{y — ^oo)-i-] characterises the law of Xaa uniquely, Xaa y. As Xq = 
f X y{dx) < oo, it also follows that E [|Ar|] = E [Xr] < oo. Finally, we observe that 
Xr = E [Aoo|J>] and X^o ^ y GV^ imply X is a UI martingale. □ 


Corollary 2.6. If (fr) is a terminating measure-valued martingale with fo = y, 
then for every 1-Lipschitz function f, X^ := f.{f) = J f{x)f.{dx) is a uniformly 
integrable martingale with Xq = J f dy and ~ Ht)- 

We also wish to discuss the continuity of the process (^r)- In order to do this, 
we make the following definition: 


Definition 2.7. We say that a measure-valued martingale is continuous if, for any 
1-Lipschitz function /, X^ = J f{x)f.{dx) is almost surely continuous. 

It immediately follows that X. = J x f. (dx) is a continuous process whenever 
(fr) is continuous. This is also equivalent to requiring (almost sure) continuity of 
r i-A- in the topology of Wi, the first Wasserstein metric, by the duality of the 
Wasserstein distance Theorem 6.1.1]. 

Having introduced these concepts, we will take the second control in our problem 
to be the choice of a process (^r), subject to the constraint that is a terminating, 
continuous measure-valued martingale with = M- 

Observe that, once we have chosen a process (fr), the ‘asset price’ at time Tr 
is given by J xfr(dx). Since the process Tr is non-decreasing, there exists a right- 
continuous inverse, T^^ = inf{r > 0 : Tj. > t}, and introduce T^ = inf{r > 
0 : Tr A T > t}; moreover, there can be only countably many jumps in Tf . We 
therefore define the cadlag process 


(2.3) St = J (da;), t<T, 

and note that St = J a:^oo(da;). The average process is then given by 

(2.4) At = f Ssds= f f a; (da;) ds. 

Jo Jo J ’ 

Then the main problem we wish to solve is the following: 
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Problem 2.8 (Basic optimisation problem). Given an integrable probability mea¬ 
sure pL on K+ and a function F : K_|_ —>■ K_|_, we want to find a probability space 
(n, ('Ht)tg[o T]j IP) satisfying the usual conditions, and a cddldg UI martingale 
iSt) tG[o,T] on this space with St pi which maximises E[i^(A 7 ’)] over the class of 
such probability spaces and processes. 


Remark 2.9. Since we do not require Hq to be trivial, Sq need not be a constant. 
However, for the Asian option, it holds for any probability space and cadlag mar¬ 


tingale St as given in Problem 2.8 that one may construct a sequence of cadlag 


martingales (S'”) such that Sq = sq G K, Slf ~ p., and 


lim E [F(AJ)] = E [F{At)] . 


(2.5) 


Hence, the value of Problem |2.8| remains the same under the additional assumption 
that Sq = Sq, and for any optimiser to the former problem an approximately optimal 
sequence may be constructed for the latter; we refer to Lemma 5.1 in Stebegg [45] 
for a precise argument (see also Assumption 3.9 in Guo, Tan, and Touzi and 

the proof of Lemma 4.1 in Dolinsky and Soner 24 for related arguments). We 


argue in the proof of Lemma [2. 11| below that the value of Problem |2.8| remains the 
same if restricting to martingales which are piecewise constant over arbitrary but 
finite partitions. Hence, a similar argument yields that the value of the problem 
also remains the same if we restrict to continuous martingales. 

To formalise this remark, and since we generally wish to work with probability 
spaces satisfying the usual conditions, we extend our framework slightly: given a 
complete probability space with a right-continuous filtration we can always 

extend the filtration to (—e, oo), for some e > 0, by taking Qt to be the (completion 
of the) trivial cr-algebra for t < 0. Similarly, a cadlag process Zt on [0, oo) can be 
extended to a cadlag process on (—e, oo) by setting Zt to be some constant value 
for t < 0. Since this is constant we may write Zq_ for this value without confusion. 
Similarly, to avoid the excessive use of e’s, we write {Gt)tGlo-,oo) to denote a filtra¬ 
tion extended in this manner. All other terminology (e.g. martingales) are then to 
be understood in the obvious way. 

Our first claim is that Problem |2.8| is equivalent to the following formulation: 

Problem 2.10 (Measure-valued martingale formulation). Given an integrable prob¬ 
ability measure p on M_|_ and a function F : M_|_ —>■ IR_|_, we want to find a probability 
space (H,^, (0r-)r-G[o-,oo);lP) satisfying the usual conditions, a progressively measur¬ 
able process Xr € [0,1], such that As ds —>■ oo a.s. as r ^ oo, and a finitely 
terminating measure-valued {Gr)rGlo-,oo)~™ortingale (^r)rG[o-,oo] with ^o- = P and 
f x^r(dx) continuous a.s., which maximises V,[F{At)] with At given by (2.4|. 


Lemma 2.11. Problems \2.8\ and \2.10{ are eguivalent, in the sense that the values 
coincide and if there exists an optimiser in Problem \2.8l then we can construct an 
optimiser for Problem 2.1C\ and vice-versa; if the supremum for the problem can 
only he approximated, then equivalent approximating sequences can be found. 

Moreover, if F is continuous, then the value of the problem remains the same if 
we restrict Problem \2.1C\ to probability spaces and processes such that the filtration 
Gr is the usual augmentation of the natural filtration of a {Gr)r>o-Brownian motion 
and {£,r)r>o is continuous in the sense of Definitions 2.4 and 2.1. 


As a consequence, if the restricted measure-valued martingale problem admits 
a solution, then a corresponding optimiser may be constructed also for the Basic 
optimisation problem. Before proving this result, we give an auxiliary lemma. 
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Lemma 2.12. Suppose (^r)rG[o,oo] is a martingale on (^/r)rG[o,oo] > P) such 

that E[|Xoo|] < oo. Then there exists a terminating measure-valued martingale, 
(^r)rG[ 0 ,oo]> such that Xj. = f x^r(dx), Q.s. for all r G [0, oo]. 

Proof. Define the ^oo-measurable random measure ^oo(da;) := 5xoo(da;). Then, 
Coo GV^, a.s. Further, let ^ be a countable Boolean algebra generating B{R) and 
define the ^^-measurable set function Cr by 

(2.6) C.(^) :=E[Coo(A)|0r], As A 


Since Coo is countably additive a.s., so is Cr 

,iT^h 


Indeed, for S .4, n S N, such that 


yjAn G A and Ai n Aj = 


OO 


oo 


C.(UA) =E[Coo(UA„)|0,] = > E[Coo(4„)|0,] = > Cr(^n). 


n—1 


n—1 


Since Cr is also finite, it follows that (2.6) uniquely defines a ^^-measurable measure 
on up to a null set, on which we arbitrarily take Ct = <5o. Next, let O := {A G 

;B(M) : Cr(4) is a martingale on [0,oo]}. Since, for any r G [0,oo], Cr is a measure 
and thus continuous from below, it follows that O is a monotone class. Indeed, for 
An G O, n gN, with An C An+l C ..., 

E[Coo(U4„)|^r] = lim E[Coo(4„)|0r] = lim Cr(^n) = Cr(u4„). 
n—>-oo n—^oo 

Since .4 C O, we have by the monotone class theorem that Cr(^) is a martingale 
for all A G ;B(]R). Since E]/ |a;| Coo(da:)] = E[|Xoo|] < oo, this yields in particular 
that Cr € 'P^, for r G [0,oo]. According to Remark 2.2 (Cr) is thus a measure¬ 


valued martingale. It is terminating by definition. It therefore follows directly from 
that f x^r(dx) = Xr, a.s. for r G [0,oo]. 


Lemma 


2.5 


□ 


39 Theorem 1.3], on 


Remark 2.13. The above result can be partially found in 
taking (Cr)rG[o,oo] as the optional projection of the random measure Sx, 

Proof of Lemma \2.11\ We show that every candidate solution to Problem |2.8| gives 
rise to a candidate solution to Problem |2.10[ and vice-versa. The claim about 
optimisers follows. 

We first suppose that we have a solution to Problem 2.8 By Monroe 43 Theo¬ 
rem 11], there exists a probability space (11, G', (I^()sg[o,oo)j E)j a (^()-Brownian mo¬ 
tion (Ws) with Wo = f xfi(dx), and a right-continuous (0')-time change (Tt)t£[o,T], 
such that (St) and (ITVJ are equal in law, tt is almost surely finite, and Watt is a 
UI martingale. We then define C(i to be the law of Wrj. conditional on G),. That is, 
we apply Lemma [2.12 to the process Watt to obtain a terminating measure-valued 
martingale (Cs)sg[o,oo ]5 such that J x^(,(dx) = Wsattj a.s. Note that the properties 
of Cs are preserved by defining Co_ = /x, and that f xC((da;) must be continuous. 

We now need to construct a measurable process At giving rise to a time-change 


Tt via (2.2) such that the process (St) given by ( |2.3| ) is the required process. Note 
that by construction, (St)t>o and (/ x C(-j (d^i))= (WtJ^q are equal in law, and 
therefore they both give rise to the same value of E [T(At)]- We will now modify 
the time-change and deduce that this gives rise to the correct process. Specifically, 
we recall that tt is finite a.s., let 


(2.7) 


T-i = 


I T 


rr + l + ^ 


t < T 
t > T 


and, in turn, define := sup{t > 0 : Tf~ < r}. 
that Tj“^ is strictly increasing, with — T~^ 
non-decreasing and 4-Lipschitz. In particular, Ttt-i-i-i-t = T -|- r for r > 0 so that 


From (2.7) we immediately see 
so that T 


> for t > s 


IS 
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Tf. ^ given by (2.7) is indeed the right-continuous inverse of T^.. Further, with Rr — 


Tr.f\T 


, rp and ■= T G [0,oo], it follows that Rp-i{uj) = Ttioj), t < T, and, 
thus, {St)t<T and ^/ a;^.j,-i(da;)^ are equal in law. Indeed, G for r > tt- 


Therefore let {Qr) be the (right-continuous) time-changed filtration given by Qr = 
Q'ji , r G [0,c»). Then is a finitely terminating measure-valued (t/j.)-martingale. 
Further, £ Q'^ = I/' , i. = Gj-^ and, thus, G Gr- Recalling the properties 

of Tr, we deduce that there exists a process G [0,1] which is Gr measurable 
and such that = T /J" As ds. Hence (possibly by taking a modification), A^ can 
be assumed to be progressively measurable, and it is immediate that t oo as 
r —>■ oo. 

To see the converse, suppose we are given a solution to Problem |2.10| From 
Lemma 


2.5 


it follows immediately that S. = J x ^p-i,,{dx) is the required process. 
It remains to argue the second part of the lemma. Indeed, in general, the time- 
change granted by Monroe [43| may not necessarily be measurable with respect to 
the Brownian filtration. However, for any probability space (H, "H, ('Ht)tg[o,T]iP) 
and cadlag martingale {St)tG[o,T]j may define a sequence (iS'")jg[o_T] by 


S? = Su 


n > 1. 


'[nt/T]T/n 

Then the (S'”) are still martingales with St ~ l^- Further, since F is continu¬ 
ous, F{Ap) converges a.s. to F{At), and an application of Fatou’s Lemma gives 
that E[F(H 7 ’)] < liminf„_,.oo E[F"(Hy)]. In consequence, the value of Problem 2.8 


remains the same if restricting to martingales which are piecewise constant over 
arbitrary but finite partitions. Since any discrete martingale may be embedded in 
a Brownian motion with stopping times measurable with respect to the Brownian 



suiting measure-valued martingale is finitely terminating, and that the first time 
the integral of As reaches T is finite also follow immediately from this embedding 
procedure. □ 

Remark 2.14. We note that the embedding of a process (<S't)tg[o,i] iiito the pair 
of a continuous measure-valued martingale (^r)r-G[o,oo] and time-change (Ar)r.e[o.oo]) 
is not unique. In particular, choosing := 1 — -I- ^ (c.f. (2.7)) renders 


Tp < 2 a.s. and the problem might be viewed as evolving on the finite time-scale 


r G [0, 2]. In Lemma 3.4 below, we will consider yet another scaling which yields a 
specific relation between the evolution of the ^ and the A. 

Remark 2.15. We observe in fact that, from the proof of the lemma, if A^ = 1 
for r G [u, u), for some interval [u, u), then for all r G [u,v). In particular, 

A = 1 corresponds to a constant ^ and, thus, (St) is constant on t G [Tu,Ty). 


3. The Dynamic Programming Problem 

3.1. Problem formulation and continuity. We want to write the optimisation 
problem as a ‘Markovian’ optimisation problem: we suppose that at time r, we 
have ‘real’ time = t, current law ^ G running average At^ = a, and we 
wish to find: 

(3.1) U{r,t,^,a) = supE [F’(AT)|r^ =t,^r = ^,At^ = a], 

where the supremum is taken over all time-change determining processes (A„)uG[r,oo) 
and measure-valued martingales (Cu)iiG[r,oo) satisfying the conditions of Problem 
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2.10. By Lemma 2.11 it follows that 1/(0, 0,/i,0) will be the value of the Asian 
option under the optimal model. At this stage, we directly define the function in 
to equal the value of Problem |2.10| when the law to be embedded is given by 
the horizon by T — t, and the payoff function by F{a + ■). Then we have: 


Lemma 3.1. Suppose F is a non-negative, Lipschitz function. The function U : 
IR+ X [0,T] X X IR_|_ —>■ K zs continuous (here the topology on is the topology 
derived from the Wasserstein-1 metric), and independent ofr. 


Proof We begin by observing that continuity in all the variables except f follows 
immediately from the continuity properties of F: any small change in a will result 
in a direct shift in the final value of At, while small changes in t can be handled 
by computing the average of the same model over the modified time-horizon. In 
addition, the independence of the problem on the value of the ‘measure-valued’ 
time-scale, r, is immediate. 

We consider finally the continuity in the measure, f. Consider a given probability 
space (0,0, (5s)s>r,IP) and a measure-valued martingale {fs)s>r- Recall that Wi 
is the Wasserstein-1 metric space, and write for the metric on this space. Let 
f' We will first show that, if dwi(Cr,^0 < then there is a measure-valued 

martingale {f's)s>r such that and E [|/ xfs{dLx) — f a:^((da;)| \Qr] < e 

for all s G [r, oo). Recall that dw^{^r,f') < £ implies that there exists a trans¬ 
port plan, r G AI(K-|- x K+) such that ^'(dj/) = r(M-|_,di/), ^r(da;) = r(da;,K-|_) 
and / / |a; — 2/1 r(dx, dy) < e. First, by disintegration (e.g. Theorem 5.3.1]) 
there exists a Borel family of probability measures, m{x, dy) such that r(da:, dy) = 
fr{dx)m{x,dy). 

Now define the process 

■= j 6(da;) m{x, di/), s > r. 

Then G and = f. Further, for any A G B{R), since m{-,A) is measurable. 




m{x,A)f^u{dx)\Qs 


= J m(x, A)^^(da;) = C{A), 


s < u, 


and, thus, s > r, is a measure-valued martingale. Next, note that 
Jxfsidx)- J xf,({dx) = J J{x - y)^s{dx)m{x,dy) 


< 


Hence 


E 


a;^s(da;) - J xf({dx) 


J J\x-y\^s{dx)m{x,dy). 


By the definition of the metric on Wi, since dw^ fO < £, we can find a transport 
plan F with the desired marginals and f J \x — y\ r(da;,d 2 /) < £■ Fix some process 
(As)s>r, and write Af^ for the average process corresponding to the measure-valued 
process f and the time-change process A, conditional on J>. Recalling (2.41 we have 

E 



\Qr 

= E 

/ / X (dx) ds — / / x^!L-i(da:) ds 

Gr 




JTr J ^ Jtt j 



< E 


J J x£,j.-i{dx) - J xf'.j^-i{dx) 


ds 


<£{T-Tr). 
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Now fix e' > 0 and consider G such that < ^V(27"C)) where 

C is the Lipschitz constant of F. Then there exists {^8,^s)s>r, such that ^ 

and U{r,t,^,a) <E F Gr + s'/2. Using the estimate above, and by the 

Lipschitz property of F, we can moreover find such that and 


1 

1_1 


Gr 

< e' j2. It follows that 

r,t,^,a) < E 


Gr 

-h e'/2 < E 


Gr 


F s' F U(r,t, , a) + s'. 


By symmetry, |t/(r, t, a) —C/(r, f, a)| < e'. Finally, we note that joint continuity 
follows as a simple adaptation of this argument combined with the arguments for 
the other parameters. □ 


Since the function U{r,t,^,a) is independent of the parameter r, we will often 
write C/(t, o) where there is no confusion. 


Remark 3.2. Continuity of the primal problem as a function of ^ was proven by 
alternative methods in Dolinsky and Soner Theorem 4.1]. As demonstrated in 
Guo, Tan, and Touzi 33 Proposition 4.3], upper semi-continuity can be proven by 
yet an alternative method. We now recall their argument in the present context. To 
this end, consider the space of all cadlag paths on [0,1] and let the filtration be the 
one generated by the canonical process (St). Problem 2.8 can then be formulated as 
maximizing E[F'(A 7 ’)] over martingale measures satisfying the constraint St /i- 
Given a sequence of probability measures (/i„) on IR_|_ converging in dwi to /r, let 
(P„) be a sequence of martingale measures such that St l^n, and 


(3.2) lim E’^" [F"(A'r)] = limsup 17 (0, 0). 

n—>-oo 

According to Jakubowski |^, there exists a sub-sequence (Prafc)fe>i which is weakly 
convergent with respect to the so-called S'-topology on the set of cadlag paths. Let 
Po be the limiting measure. According to Guo, Tan, and Touzi [^, Pg is then a 
martingale measure and St IJ-- Since the mapping uj i—?> At{uj) is S'-continuous 
(c.f. Corollary 2.11 in [^) it follows that 

t/(0,/r,0) > E’^«[F(At)] > lim E'^-»= [F(At)] , 


which combined with (3.2) yields the upper semi-continuity. 


3.2. Reduction to a finite dimensional problem. Our aim now is to provide 
a more concrete description of the function U. However, because the function U is 
continuous in we can restrict ourselves to a nicer class of problems: specifically, we 
can approximate our object of primary interest, U{t, a) by a sequence U{t, , a), 
where can be chosen to have nice properties. For our purposes, a natural 
simplifying assumption is to assume that the measures are atomic measures. In 
this case, as we shall see, the problem becomes much more tractable via classical 
methods. As a consequence of this reduction, we will be able to deduce that a 
Dynamic Programming Principle holds by standard results from the literature. 
However the more theoretical question of whether a DPP holds for the original 
formulation is proved in the appendix; this result will not be used elsewhere in the 
paper. 

To do this, we let Xjv = {xo,Xi,... ^xn}, where 0 < xq < a:i < • • • < xn, and 
write V^(Xn) = (X^v) and V^{Xn) = Observe that if (^r) is a 

terminating measure-valued martingale and then € 7^^(Xjv) a.s. for 

all r > 0 and ^oo = for some Xi G X^r. Further, write a C {0, 1,..., fV}, X^ = 
{xi : i G a}, and 'P^(Xq,),7^®(Xq) etc. as above. In particular, Xjv = 

We then have the following characterisation: 
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Lemma 3.3. Suppose p G 7^^(Xjv). Then {^r) is a measure-valued martingale 
with ^0 = p if and only if := ^r-({a^n}) is a non-negative martingale for each n 

and Moreover, {^r) is terminating if and only if = 0 for all but 

one n G {0,1,2,..., TV}, almost surely, and (^r) is continuous if and only if is 
continuous for each n. 

It is clear that there are similar statements where is replaced by V^lfKa). 

Then we consider the further consequence of Lemma |2.11[ by the Martingale 
Representation Theorem, working on the probability space granted by Lemma [2.11[ 
we can assume that the dynamics of (^r) ER'e given by a controlled Brownian motion, 
in a recursive formulation. For fixed > 1, we suppose that (^r) solves the SDE 

(3.3) dC = W^ldWr, 

for [Wr) a standard Brownian motion, and where w,. = {w ^,..., G K^, = 

— J2n=i € (0,1} implies w” = 0, n G (0, ...,TV}, — that is, as soon as 

one of the atoms disappears, it can never be resurrected. 

Next, we show that {^r) and (A^) may be chosen so that a specific relation holds 
between and A^. 


Lemma 3.4. Let p. G and consider a martingale (S't)tg[o,T] represented via 

(2.3) by processes (Ar, ^r)rG[o,oo) given on a probability space (A2, 5, (l/r)r-G[o,oo)j P)? 
such that Xr G [0,1] is a progressively measurable process and {^r) is a measure¬ 
valued martingale with = fi. Suppose further that (Qr) is the natural filtration 
of a Brownian motion (Wr), infjr > 0 : /p A^ds = Tj < oo a.s., and {^r) is 
continuous and finitely terminating. Then, w.l.o.g., we may assume that 


(3.4) 


+ A„ — 1 — 1 


{5,»e’PT-‘-{T„=T}- 


That is, we can always choose a multiple (17,1/, (^r)) P) (Ar, Cr)) which represents 
[St) via (2.3), and which satisfies the above properties as well as (3.4). 


Proof. Suppose (17, Q, (l/j.),P, {Xr,£,r)) satisfy the assumptions of the lemma (apart 
from (3.4)). We aim to construct a time-change such that the time-changed fil¬ 
tration (Qu) and time-changed processes (Xu, ^u)u>o satisfy the assertions. To this 


end, recall that T^ is given by ( 2 . 2 ) and let r := infjr : G and = T}; 

since is finitely terminating, r is finite a.s. Let : 17 x [0, oo) —)• be given by 

(j){u) = /p rjsds for some positive, adapted process rju such that (j){oo) > r. Then 
is continuous and increasing in u, and its inverse (j)~^ is well-defined and continuous 
on [0, r]. We define 


(3.5) 


S-Ild Tu ■ - 


and let (0u)ue[o,oo) the time-changed filtration with — G(p{u)- Note that is 




a continuous measure-valued (^ii)-martingale. Moreover, ( |3.5[ ) implies that — 
(/)“^(T(“^), t < T (recall that Trp^ = oo). Hence, St is given by (2.3) defined with 
respect to and Tu. It remains to argue that rju can be chosen such that 


(3.6) 


-)- A^ — I . 


First, note that </> ^(r) = infju : G 7^® and T„ = Tj. By the Martingale 

Representation Theorem, we know that is given by (3.3) for some process (w^) G 
Since there is a (l/u)-Brownian motion (i?„) such that dW 0 („) = rjudBu, it 
follows from (3.5) that 


(3.7) 


d^u — '^ip(u)VudBu and Xu — Vu^rtM’ 














12 


ALEXANDER M. G. COX AND SIGRID KALLBLAD 


Let Wu = Then (3.6) holds for m < ^ if 1 satisfies 

1 


rii = 


< <!> (r). 


,|w0(„)|P + A0(„) 

We therefore proceed by defining (f>~^ via 

d(/)-i(r) := (llw^lp + Xr) dr, 

It follows from the construction of (Cr)rG[o,oo) and (Ar)re[o,oo)) that for r < t, 
IlwrII = 0 if and only if A,- = 1 (c.f. Remark 2.15[ and note that ||w|| = 0, if 


r < T. 


and only if, ||(it;°,w)|| =0). In consequence, ^ ^ is well-defined, continuous and 
strictly increasing on [0,r]. In particular, 


^ M = [ (||wr|p + Ar) dr = T+ f ||wr||^dr, 

Jo Jo 

and we observe that, as the quadratic variation process of a bounded martingale (in 
M^), /J" ||wr|p dr is almost surely finite. It follows that and r]u are well-defined, 
for u < (j)~^ (t), and that ipu and (j)~^ are adapted with respect to the respective 


filtrations (Gu) and (Qr)- In consequence, and Xu are well-defined via (3.5) for 


u < 4> and, according to (3.7), A„ = rjl^X^^u) G [0)1]) and can therefore be 


assumed to be progressively measurable (possibly after taking a modification). For 
u > 0“^(r), we take = 0 = A„. □ 

Remark 3.5. When embedding piecewise constant martingales as was done in the 
second part of the proof of Lemma 2.11 it follows that A^ G {0,1}. However, the 


solution to Problem 2.10| (which in general is not unique), and thus to the basic 
optimisation problem, may be attained for more general processes A„ S [0,1]; c.f. 
the non-convex example considered in Section [4.2[ Hence, we do not further restrict 
the set of A’s even though the value of the problem would remain unaffected. 

Given the above dynamics of the stochastic factors, we note that U{r,t,^,a) in 


(3.1) is now well-defined as the value-function corresponding to a class of dynamic 


control problems. In particular, w.l.o.g., we may optimise over controlled processes 
defined on a fixed reference probability space; see e.g. 30 . The following result 


is now an immediate consequence of the lemma above. Let Au^ = {(wu)„>„(, 

/I Ar \ - I I I I . . 

Wu prog. meas.,w^ 
control^ 


= e and 


<1} the set of admissible 


Corollary 3.6. For eachw G Au^, define {Xu)u>uo by and (Cu)«>«o by ( |^ 

with w*. Then, for /i G Problem 2.10 in its restricted form 

(c.f. Lemma 2.11) is equivalent to finding a process w G Aq such that fjf G {0,1} 
implies wf = 0, s > u, for n G {0,..., A^}, and such that w maximises E[F(Ht)] 
over the class of such processes where 

(3.8) dHT„ = {xo,...,xn) ■ iC^...Au) ^uT du. 


Moreover, for all e > 0, there exists u* = u*{s) > 0 such that, for any fj.,t,a, 

^n) ~ M) ^UO ~ ^) ^ ^ 


inf P(e„„+„. e 
weA„„ 


Proof. The only part of the first half of the corollary that does not follow imme¬ 
diately from the previous result is that the process (fu)u>o which results from a 
given choice w G Aug is terminating, and this follows once we show the second half 
of the result. 


^Strictly speaking, we should also include here the set of possible probability spaces, as in [30| ; 
for ease of presentation, we omit this detail, which does not affect our arguments. 
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To see the second half of the result, note that it is sufficient to show that there is 
a similar bound for the first time that Cu = 0 for some n G {0,1,..., N}. Consider 
the process at time u* ^ Ug, and suppose that the measure ^ has not already 
become singular at this time, so we have 

f (||w„|| + Au)dM = M* - M 

J Uo 

In particular, we can ensure that maxfc | drt| is arbitrarily large by choos¬ 

ing u* sufficiently large. Let k* be the maximising component; it follows immedi¬ 
ately from the fact that is a [0, l]-valued martingale with quadratic variation 
process {w^ du, that with high probability at least one component must have 
become zero or one. □ 


\du > u* — u — T. 


Between Lemma 3.1 and Corollary 3.6 we have shown that the problem (3.1) 
is equivalent to choosing the variable w in the problem above. Moreover, we can 
break the problem up into a sequence of independent problems by considering the 
process only up to the first time that one of the current atoms becomes zero. In 
particular, for ^ G 7^(X^), we can consider the problem: 


(3.9) 


Vjv(u,t,^,a) = sup 

wGA™ 


E 




+-F(At„)1{t„=t}|^t„ = = t , 

where cr = inf{s > u : ^ (0,1) some n G {0,..., N} or Tg = T}. We also have 

the boundary conditions Vo{u,t,^,a) = F{a -f (1 — t)x), where ^ = Sx- Here, the 
function Vk{u,t,^,a) is determined for ^ G 7^^(Xq,) with |q;| = fc -|- 1. Specifically, 
for ^ e 7^^(X„) with |q!| = fc -|- 1 


1 (cr. Ter, ) 1{T,, <r} 

+ F{At^)1{t^^t}\^t^ = o,,^u = ^,Tu = t , 

where we set a = inf{s > u : G T’^(Xq,) some a, |a| < k or = T}. Observe in 
particular that each ^ G V^{Xn) determines a unique set a such that ^ G V^{Xa') 
implies a' D a. Write a(^) for this unique subset, and we observe that we have the 
consistency conditions: 

= Vk{u,t,^,a), for all fc > |a(C)| - 1. 

Finally, fix ^ with |q!(^)| = fc -I- 1. We can make the identification between the 
probability measure ^ i^xi. (on X^), where io,ii, ■ ■ ■ ,ik are the ordered 

elements of a and the vector = (CA ■ ■ ■ j C‘) G •= G K+ ■ — !}• 

Specifically, = 1 — 1 • With this identification, we define: 

Va{u,t,C,a) = Vk{u,t,^,a)- 

We write x“ = {xig,Xi^,... ,Xi^), and = {z G : || 2 :|| = 1} for the unit sphere 
in Finally, note that for |q:| = 1, Va{t, a) = Vo{t, a) = F(a + (T — t)xig), and 
we then define (1 — 1 • := = 1- We afso use the convention S° := 0 and 

sup0 := —oo. 


Vk{u,t,^,a) = sup E 


Theorem 3.7. Suppose F{a) is continuous and non-negative. Fix a C {0,..., N}, 
with |q!| > 1, and write fc = |a| — 1. The function Va '■ K+ x [0,r] x x K_|_ —>■ K 
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is independent of u, and is the unique non-negative viscosity solution bounded by 
F{a + (T — t)xN), to 

(3.10) max(^^+x“ • (1 - 1 sup [tr(ww'^D|Va)] | = 0 

I ot da J 

for e and t <T, with the boundary conditions 

/oil', = F{a) 

where the second equation is taken when S dA^. Here a' is the subset of a 
corresponding to non-zero entries of [1 — 1 • and^°‘ is the vector identifying 

the corresponding probability measure. 


Proof. We work by induction; suppose the problem has been solved for k' < k, 
to give continuous value functions, independent of time. The case where k = 0 is 
trivial. The first step is to approximate by a problem with a finite time-horizon. 
To this end, we fix a sequence K oo, and consider the following two problems: 
For given (u, t, a) with ^ € P^(Xa) and |a| = A: -I- 1, we set = a A (K + 1) and 
define the functions V/f- and by 


yk{u,t,£,,a) = 


sup 

k.K 


E 


wgA: 


^F{a + (T - t)xN)ia^ - K)+) V (f(At ^ )1{t ^=t} 




-^T^K)l{r k<t} 


\Tu = t,^u = f, = a 


and 


V^(u,t,^,a) = sup E (F{a + {T - t)xN){K + 1 - a^)+) A (^F{At^j^)1 


weAS 


{T„k=T} 


+ Vk-i{cr^\T u = t,fu= = a , 

where = {(ws)sg[„ : prog. meas. with Wg S and ||w|| < l} and, 

as previously, cr = inf {s > u : ^ (0,1) some n S {0,..., k} or Tg = T}. With the 

same identification as above, we define and by 


Vaiu,t,C,a) = V\a(^)\_i{u,tA,a) and V^{u,t,C, a) = 


K 




_i A, of. 


Recall that the dynamics of the involved factors is governed by (2.2), 


and 

(3.8), with As given by (3.4). Note that without loss of generality, we may write 
= {(As,Ws)sg[„,a+i] : (As,Ws) prog. meas. with Wg G S'" and Ag G [0,1]}. 
It follows from Corollary V.3.1] that on the domain [0, iF -|- 1] x [0, T] x K'" x K, 
Vff and are both viscosity solutions to 
(3.12) 


dV^ 

du 


— sup 

wGS'^, 

AG[0,1] 


^(1 — A) tr(ww^iA|V)j) + A 


dV dV 


= 0 , 


equipped with the boundary conditions 


(3.13) 


r y„(u,r,e,a) 

1 Va{u,t,C°'',a) 


F{a) 

Va'{u,t,i°‘\a) 


for u < K, and either increasing to F{a + {T — t)xN) for u G [K, K Fl\ in the first 
case, or decreasing to 0 in [RT, iF -f 1] in the second case. In both cases, we have a 
viscosity equation with controls in a compact set, and with continuous boundary 
data on a compact domain. It follows that both equations have unique, continuous 
viscosity solutions, and the viscosity solutions to both equations correspond to the 
value functions of the corresponding optimal control problems. In particular, we 
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see immediately that (u,t,^,a) > Va{u,t,^,a) > ioi u < K 1. 

Moreover, from Lemma |3.H identifyin g U and Va-, we know the function Va is con¬ 


tinuous, and moreover, from Corollary 3.6 we know that (u, t,^,a) will decrease 


pointwise to Va{u,t,^,a) as K ^ oo, and V^{u,t,^,a) will increase pointwise to 
the same limit. We conclude that Va is a viscosity solution to the given equation 
(see e.g. Barles and Souganidis [^). 

Now suppose that W is another viscosity solution to the same equation, also non¬ 
negative and bounded by F{a + (T — t)xN). By the comparison principle, for any 
K, Va > V^{u,t,^,a), for u < K. Hence Va > W > Va', 

that is, W = Va- Finally, we observe that the solution Va is independent of u, by 
Lemma 3.1 so = 0, and optimising over A immediately gives the equivalent 
formulation. □ 


Remark 3.8. We note that some obvious generalisations of this setup can easily 
be handled. For example, consider Asian options with non-constant weighting, so 
At = f{t)St dt, for some (possibly signed) continuous function / : [0,r] —)• 
M. A simple modification to the arguments above gives the same result with the 
corresponding PDF: 


max 




(1-1-e 


8V 

t tax 

^ da' 


sup 


[tr(^ 




4. Examples and Superhedging 


In this section we consider some simple cases where explicit solutions to the 
viscosity equations in Theorem |3.7| can be given. We also give some arguments 
regarding the construction of superhedging strategies. A number of the results in 


this section can be compared to the recent work of Stebegg 45 , but we emphasise 


that our results extend beyond the case where F is convex, and we will consider 
such an example below. 

4.1. Convex payoff functions. 

Lemma 4.1. Suppose the function F is convex and Lipschitz. Then for all f € 

r\R+): 

U{t,^,a)= j F{a + {T — t)x) C(da:). 

Moreover, an optimal model is given by: 

So- = j a;^(da;) 


St = St, 


t > 0, 


where St 


In terms of the class of models considered in Corollary |3.6[ this corresponds to 
a model which takes = 0 until the measure and then setting A„ = I 

until T,, = T. 


Proof. By continuity, we are only required to check that (3.10) holds for atomic 
However, if we write C = / 2 ;^(da;), then 

^ = J F'{a + {T- t)x) (C - x) e(da:) 

< IF'{a+ {T- t)e) (? - x) e(dx) = 0. 

Moreover, ii t = T or ^ G 7^® then we have equality. 
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Recalling the notation of Theorem 3.7 we have 

U{t,^,a)= F{a + {T-t)x,^)e^ 

j=o 


and computing the second derivatives, we have Dill = 0. Hence (3.10) holds. □ 


In this convex case, we are easily able to provide a martingale inequality interpre¬ 
tation of this result. Indeed, this has already appeared in 45 . Since this will help 
our intuition, we provide an alternative approach to 45 , which will enable us to 


gain insight into the optimal strategies for the non-convex case. We restrict first to 
the case where F{a) = (a — K)^, for some K > 0, and we write Yt = At + {T — t)St. 
We suppose also that (St) is a continuous semi-martingale (although a pathwise 
analogue of this argument is possible, where St is assumed just to have continuous 
paths). From the definition of local time, we get: 


{At - K)+ = {Yt - K)+ = (Fq - K)+ + Mt + , 

where Mt is a local martingale, and is the local time of Y at the level K. It 
follows from the definition of Y, that we have: 

Lt^ = j {T-t) dif’-^*, where Kt = ^ , 

SO LS.k, 

is the local time of the asset price along the curve Kt- That is, we have 
a local time contribution coming from the crossing of the curve Kt by the asset 
price. However, for a given distribution of S'j', the local time at each value of x is 
fixed, and decreases as |a: — SqI increases. We now argue that is maximised 
by trying to accumulate all the local time on the curve Kt as close as possible to 
time zero: that is, all crossings of St = Kt should happen as close to time zero as 
possible. This happens since if St ^ Kt, then \St — Kt \ is increasing, and there will 
be less local time which can later be accumulated at Kt since the process needs 
to accumulate the local time at a (relatively) more distant point. In addition, the 
factor (T — t) which appears in the integral only makes the weight of local time 
accumulated at later times smaller. 

It follows (and again, this can be made rigorous) that the optimal model should 
make all crossings of Kt necessary to embed in a short time interval. After this 
time, it is irrelevant how the process behaves, so long as it either remains above or 
below Kt- 


Remark 4.2. The cases where there is a positive interest rate can be handled 
similarly (the process Yt = At + ^ — l) should be used instead). In addi¬ 

tion, by adding constraints, one can extend to general convex functions, with the 
model which crosses each relevant curve Kt corresponding to a convexity point of 
F immediately being the optimal choice. 


4.2. A non-convex example. In this section, we provide a solution to the prob¬ 
lem for a non-convex example. Specifically, we use the intuition from the convex 
case established above to try and find a solution to the problem for a payoff function 
of the form: 


(4.1) 


F{At) = {At — Ki)^ — {At — K2)+, 


Ki < K2 


Given certain additional assumptions on the measure we wish to embed, we will then 
verify that an optimal model may be determined through the use of Theorem |3.7[ 
We observe that the results of this paper simply verify the existence of an optimal 
model. Given the existence of an optimal model, the existence of a super-hedging 
strategy follows from general results (e.g. Dolinsky and Soner 25 ). 
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The intuition established above suggests that we wish to gain the benefit of the 
convexity at Ki immediately, while leaving the concavity at K 2 until as late as 
possible. However there is a trade-off, since the process may sacrifice some of the 
convexity at Ki by waiting at K 2 . To specify this, note that since the payoff is 
constant for At > K 2 , it must be suboptimal to have positive support on events 
for which Yj > K 2 , t G (0,T]. Indeed, the payoff will not be improved by this but 
the martingale property of V implies that more mass must then be put on events 
yielding an average strictly less than In consequence, at least for some values of 
Ki, K 2 , it is natural to conjecture that at time 0, S will either run to K 2 , or to some 
lower level; at the lower level, the paths will behave as indicated by Lemma [4.1| 
For a measure /i with continuous support, we therefore define the level 77 by 


(4.2) 


rj := inf 


X G 


x^{dx) > K 2 


We then expect to accumulate all mass above x = r] into a branch of the underlying 
taking the value St = K 2 , t G (0, T), and embedding lx>r)^J‘{dx) at t = T. As for the 
mass to be embedded on [ 0 , 77 ), we expect the same optimal behaviour as detected 
for the convex case in Section 




4.1 


Put differently, at 77 = 


-1 

0 


with probability 

’ ^{dx) we expect the measure-valued martingale to take the value 

1 <-» i-> y-1 t-w-r*1 -w r\ 1 -w t 1 1 4—t r 


and with probability Jp fi{dx) we expect to recover 


^ ^ ... . .. /i([77,oo)) 

and stay constant until 
the structure of Lemma 14.11 

To specify this, we restrict to a certain class of measures /r. Specifically, we 
consider the problem at time t G [0, T] with current average At = a when we take 
x“ = (—1,0,1), so |a| = 3, and consider the terminal distribution 


(4.3) 


= {1 - 13 - j)S-i + l35o + jSi, /3,7G(0, 1 ). 


That is, = il3,j), and we write V{t,a; I3,j) = Further, we let 

Ki G (—1,1) and K 2 G (0,1). Suppose now that «+ ^(^(T”^) < K 2 < a + {T — t). 
That is, the expected averages considering the mass at both cc = 0 and a: = 1, 
and at a: = 1 only, are, respectively, smaller and greater than 7^2. Following the 
reasoning above, at 77 = , we then expect to have split into the two measures: 


(4.4) 


C = 


ySp 

V 


■jSi 

■7 


and 


2 _ (1 - 7 - /3)(5-i + {/3- r])So 
1 - 7-77 


where (c.f. (4.2)) 77 is given by 


77 = sup < 7/ : 


7 


7 + 2/ 


> 


K 2 - a 
T-t 


= 7 


T-t 

K2-a 


- 1 


If a < iFi, this yields V{t, a; (3, 7 ) = (7 + fj)(,K 2 — Ki)- However, if a — (T — t) < 
Ki < a, the result for the convex case guides us to further split the measure 
into 5_i and Jq (equivalently, all mass at a; = 0 and a: = 1 might be accumulated 
in one measure; see further discussion below) and it follows that V{t, a; 13, 7 ) = 
(7 + v){K 2 — Ki) + {13 — fj){a — Ki). Similar considerations for the other cases 
guides us to define the following candidate value function: 

(4.5) 

' K 2 -K 1 

i2-f + P-l){T-t) + a-Ki 
^^{K2-K,) 

T-t 


V{t,a;/3,j) := < 


liT-t) - {'y + /3){Ki - a) 
1^SK2-K,) 
l{T-t- {Ki - a)) 

0 


(i) 

(ii) 

(hi) 

(iv) 

(v) 

(vi) 

(vii) 


,-101 


K 2 < a 

Ki < < K2 

0-1 < <K2< a°i 

0-1 <Ki< o°,o°i < K 2 
o® < ATijoii < K2 < oi 
o® < ATi < oi < K2 
oi < Ki 
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where we used the notation a* = a + s^{T — t), with s'^ = i, i G {—1, 0,1}, 

(with the convention = K 2 when 7 + /3 = 0) and = s = 27 + /3 — 1 — i.e. 
the expected average taking the mass at various atoms into account. The function 
is depicted in Figure [T] together with a candidate sample path. 



Figure 1. The value function graphed as a function of /3, 7. Here 
t = a = 0,T = 1, Ki = —0.1,K2 — 0.5. Also shown (in red) is 
a possible path of {^r), starting from (/3,7) = (7, |). The planar 
regions in the diagram correspond to the regions (i), (iii) and (iv) 
given in (4.5). The process starts in region (iii), and runs until it 
hits the boundary of this region and region (i). The continuing path 
is then shown running along the boundary of regions (i) and (iii). In 
an optimal model, this behaviour happens at time 0, although note 
that there are many possible solutions: for example, the movement 
along the boundary between regions (i) and (iii) could happen at 
an time between t = 0 and t = T. On reaching the point 
the process is unable to move any further before the time t = T 
without being suboptimal. 


Example 4.3. Observe that sending K 2 -G 00, V(t,a] reduces to the value- 
function for the (convex) payoff function F{At) = {At — Ki)+ given in Section 


|4.l| with fj, given by (4.3). Alternatively, let /3 = 0, 
V {t, a; 7) reduces in the following way: 


Ki = 0 and K 2 = 7. Then, 


(4.6) 


H(t,a;7) 


1 

2 ’ 


7 


1 


1+ 


1/2-a j 
T-t 


27 - 1 > 
27 - 1 < 


l/2-a 

T-t 

1/2-a 

T-t 


Note that for t = a = 0 and 27 — 1 < 1/2, V* + sVa = 0 if and only if 7 = 0 or 
7 = 3/4. Therefore the optimal model (St) will jump to either —1 or 1/2 at time 
t = 0, and then stay constant until time t = T where p is embedded. 


It can be verified by elementary calculations that V (t, o; /3,7) is continuous. The 
next result verifies that it is a (bounded) solution to equation (3.10) with F{a) and 
pL given by (4.1) and (4.3). Hence, according to Theorem |3.7[ V{t,a; is indeed 
the associated value function. 
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Proposition 4.4. The function V(t,a; defined in (4.51 is the unique non¬ 
negative viscosity solution bounded by K 2 — K 1 , to the equation (3.10) equipped with 
the boundary condition (3.11). In particular, it is the value-function associated with 
the payoff (4.1) and g, given by (4.3). 

Proof. Differentiating V(t, a; j3, 7) within the respective regions, we obtain that 
Vt(t,a] /3,j) + sVa{t,a; (3,^) = 0 in regions (i), (ii) and (vii), and 


(K2-Ki)(2j+g) 

K2-a\-i 


K2-a 

fT-ty 


27+/3- 

T-t 




(4.7) Vt + sVa = 


- 7 + (27 + /3- 1)(7 + /3) 

(^K2-a-i2y + p-l){T-t 
- 1 


K2-Kr 

3 (K2-ay 


27 (. 


7 + 


in (iii) 
in (iv) 
in (v) 
in (vi) 


where s = x“ • (1 —/3 —7, /3,7) = 27-I-/3 — 1. Using that 27-I-/3 — 1 < regions 

(iii) and (v), and that 7 + ^5 < 1 implies (27 + /3 — l)^'^ < 1 for 7 > 0, it follows 
from (4.7) that Vj + sVa < 0 within the respective regions. In consequence, with 
Uv denoting the directional derivative in the direction of v, it holds on {7 +/I < 1} 
that 


(4.8) 


Uv < 0, 


= (1, 27 + /3-1,0,0). 


Recall that U(t, a;^) is a viscosity super (resp. sub) solution to (3.10) if for each 
if G and at each point (t, d,^) minimizing (resp. maximizing) V — ip, 


(4.9) 


max 


+ (27 + ^ - l)pa, ^ sup [tr (ww'^D|(/?)] 1 < 0, (resp. > 0). 
^ weS2 J 


We first argue that U is a sub solution. To this end, let p G ^14.2.2 ^ = 

(t, d, /3,7) such that z maximises V — p. Note that there exists w G such 
that the directional derivative at z along w (keeping a and t constant) satisfies 
Uww = 0; if z G {27 -\- p — 1 = ^rfZt } or z G {7^ = let w in the direction 


of that line. Since {V — i^)ww < 0, it follows that tr > 0 (note that 

tr (ww'^I?|(^) = 


^D'^p w = p„ 


(4.10) 


sup [tr 


In consequence, 
(ww"'’Zl|(^)] > 0. 


In order to show that U is a super solution, let p G and let z = {i,d,P,p) 

be a minimiser to V — p. Due to the concavity of U as a function of /3 and 7 
and the differentiability of p, z must lie strictly within one of the regions given in 
(4.7). Hence, for all w G S^, tr (ww"'’Zl|(U — (/?)) > 0 and tr (ww"'’Zl|U) = 0. In 
consequence, 

■^D^p)] < 0. 


(4.11) 


sup [t 

wGS^ 


r ww 


Let now v = (1, 27 + /3 — 1, 0,0). Since z minimises V — p,it follows that p^ < Vv 
According to (4.8), we thus obtain 

Tt + (27 -G P — 1) Pa Z Uv < 0, 


which combined with (4.11) renders (4.9). 

It remains to argue the boundary conditions (3.11). Note that for t = T, the 
only possible regions are (i), (ii) and (vii) (for a > K 2 , a G [ATi, 7^2) and a < Ki) 
for which V{t, a; P,y) is given, respectively, by 7^2 — ATi, a — Ki and 0. Hence, the 
terminal condition is satisfied. Next, note that for 7 = 0 and P = 0oij-\-P = l, 
the problem reduces, respectively, to the convex case and the case presented in 
Example |4.3| This verifies the second boundary condition and we conclude. □ 
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We now discuss the optimal control associated with the value function (4.5) and 


the corresponding solution to the basic optimisation problem, Problem |2.8| Indeed 
recall Lemma [2.11[ which says that if Problem 2. 10| admits an optimal solution, this 
solution corresponds to a solution of Problem |2.8[ Naturally, the solution coincides 
with the conjectured optimal model used to deduce the form of V (t, a; (3, 7 ). How¬ 
ever, our aim below is to illustrate how it may be deduced directly from the value 
function and, in consequence, from the dynamic programming equation 
(3.13) and to show that it is non-unique, and also non-trivial since it necessarily 
has a jump t = T as well as t = 0. We let T = 1 and split the behaviour into three 
parts. 

(I) Real time is kept constant while the measure-valued martingale evolves (Xu = 
0 and {/3u,ju) diffuses): {St) jumps to certain points at time t = 0 . 

Depending on the parameters of the problem, the starting point (0,0; (3, 7 ) lies 
in one of the regions (i), (hi), (iv) or (v). It follows from the DPP equation (3.10), 


that the model can evolve in real time only if Vt-\-sVa = 0. According to (4.7), while 


Vt -\- sVa = 0 for all (/ 3 , 7 ) in regions (i), (ii) and (vii), it holds for the remaining 
regions that Vt + sl 4 = 0 , if and only if. 


(4.12) 


27 + /3-1 = ^ or (/ 3 , 7 ) = (0,0) in (hi) 
7 -P^ = l or (^, 7 ) = ( 0 , 0 ) 


27 - 


( 3-1 = ^ 


or 7 = 0 


7 = 0 or (/ 3 , 7 ) = (0, 1) 


in (iv) 
in (v) 
in 


(vi) 


In consequence, if starting in region (i), one may immediately evolve in (real) time. 
However, if starting in regions (iii)-(v), (real) time must be kept constant while 
(/3u,7u) are allowed to diffuse until the above boundaries are reached: that is. 


A„ = 0 until the measure-valued martingale C = (/3-,7-) satishes (4.12). Note that 

K^-a 
T-t > 


since H( 0 , 0 ;/3„, 7 „) is a martingale, if at the line 


diffusion will take 

place only in the direction of that line and the region remains the same until the 
boundaries are reached. This implies that the associated price process {St) jumps 
to certain points at time t = 0 . 

(H) Progress in real time only (Xu = 1): {St) is kept constant. 

Once the measure-valued martingale satisfies (4.12), (real) time might start to 
evolve (A„ >0). In particular, one might let A„ = 1 which implies that {St) is kept 
constant. On a case by case basis, it can be verified that once at a point where 
Vt + sVa = 0, this remains the case. For example, consider sitting at {7 -I- /3 = 1} 
in region (iv). With a slight abuse of notation, we see that at this line 


D f = 27 4-/3 —1 > 0 , i = 0 

(4.13) -^{t) = \ I {a{t) - (T - t)) = 27 + ^ > 0, i = -l 

I + = 0, * = 1 


Hence, when evolving in (real) time, and with no change in ( 3 , 7 , a~^{t) = a{t) — 
{T—t) may move above Ki and so the point {t, a{t)-, (3, 7 ) moves to region (ii). Since 
Vt + sVa = 0 within region (ii), the claim holds for this case. Similar arguments 
apply to the other cases. 

(HI) When Tu = T, the measure-valued martingale = {(3., 7 .) terminates: {St) 
jumps and embeds p, at t = T. 

Real time may run until T„ = T. Thereafter A^ = 0 and {(3, 7 ) diffuses until 
terminates; that is, until = {(3., 7 .) reaches (0,0), (0,1) or (1,0). As expected, 
V{T,a{T)-,j 3 , 7 ) stays constant during this process as it is independent of 7 and (3. 
This step corresponds to S embedding n via a jump at t = T. 

The evolution in time and measure described in (II) and (HI) could, partially, 
happen simultaneously or in the reverse order. This implies that the optimal model 
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is not unique. For example, having reached the line 7 + /3 = 1 in region (iv), 
one might let {Pu,"fu) continue to diffuse until reaching either of the points (1 — 
^T-u" ’ ^T-u '" ) 0), before letting (real) time evolve. This corresponds to the 

behaviour used to deduce V{t,a; /3,j): i.e. (St) jumps to one of the values —1, 
0 or K 2 at time t = 0. Alternatively, by letting A„ G (0,1), and supposing that 
At > Ki, one may let time and measure evolve simultaneously, which corresponds 
to Yu = Au + {T — u)Su, u G (t, 1) being either constantly equal to —1 or moving 
(as a continuous martingale) between the values 0 and K 2 . Observe that this 
behaviour may result in a different distribution to At in comparison with the case 
where all the diffusion happens immediately. Similar behaviour can be observed in 
the regions (i), (ii) and (v), although the distribution of F{At) then remains the 
same. 

While the optimal model is not unique, we note that it has certain characteristics: 
the model necessarily has a jump at both < = 0 and t = T. Indeed, there is a certain 
amount of mass which is ‘locked in’, and cannot be embedded until the terminal 
time t = T. This to ensure that St = K 2 , t £ (0, T), with a certain probability (e.g. 
in regions (iii) and (v)). On the other hand, sending K 2 —>■ 00 and thus isolating the 
behaviour at the convex kink ATi, we see that the mass terminating above/below 
Ki must already at time t = 0 be accumulated above/below Ki. Although affected 
by the presence of A' 2 , this feature is present also for the general case (e.g. regions 
(iv) and (vi)). 


5. Conclusions and Further Work 


In this paper we have considered the model-independent pricing problem for 
Asian options using a novel approach based on measure-valued martingales. While 
this paper concentrated on the case of Asian options, the main ideas should gen¬ 
eralise to other cases, and may provide insights beyond the existing literature. 
Moreover, there are a number of natural questions which arise from our work: 










Is it possible to generalise the results in this paper to the case of a general 
starting law? Financially, this has the interpretation of pricing a forward 
starting option at time 0, where 0 < Tq < T), the option pays the holder 
the amount F (Su duj at time Ti and the price of call options are 


known at times Tq and Ti. Write A for the implied law of Sxg and for the 
implied law of Sti ■ It follows immediately from the results of this paper 
that the problem is equivalent to finding a function m : M. ^ ,x i-G 
nix which maximises f I7(0,mx,0) A(dx) over all functions m such that 
f mx(A) A(da:) = /r(A), for all Borel sets A and x = f ymx(dy). However, 
it would be interesting to have a dynamic formulation of the problem, 
similar to the simple case. 

The PDF (3.10) is formulated for the case of atomic measures. Is there 
a similar formulation that holds when ^ is only assumed to be measure¬ 
valued? 

What is the corresponding formulation for (3.10) in the case of (say) op¬ 
tions on variance? 

Do the methods described above extend to related problems in higher 
dimensions? If the formulation is given in terms of a measure /r on K'^, 
one might hope that a very similar approach would be possible. Is this 
also true of (the financially more meaningful) case where St G and the 
marginal distribution of each component of St is specified? 
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Appendix A. A Formal Dynamic Programming Principle 


In this section, we formally derive a Dynamic Programming Principle (DPP) for 
the pricing problem in its weak form given in Definition |2.10[ We note that our 
previous results do not make use of this DPP, but we believe that this result is of 
independent interest. We choose to follow closely the setup in 46 ; see however also 
[2^ and 44 for similar arguments. 


We denote by D the set of cadlag paths on [0, 00) taking values in E := P(IR) x 
[0, T] X K, where we equip P(K) with the topology induced by the Wi-metric and E 
with the product topology; in particular, this renders E a Polish space, and using 
the Skorokhod topology on D it is a Polish space too. For cc, x' G E, we write 
d{x,x') := V |t — t'\ V |a — a'\. A generic path in D is denoted by uj and 

we use X = {^,T,A) for the co-ordinate process: Xr{uj) = Ar){u}) = uj{r). 

The set of all probability measures on ;B(D) is denoted by ip. A map 1 / : E x 
;B(D) —>■ [0,1] is called a (universally) measurable kernel if i) v{x,-) G fp for all 
X G E, and ii) A 9 x —>■ v{x,A) is (universally) measurable for all A G /B(D); recall 
that the universal a-algebra is the intersection of the completions of the Borel a- 
algebra over all probability measures on the space, and that universally measurable 
functions are integrable with respect to any such probability measure. We write 
Vx for the probability measure i'{x, ■) and interpret v as & (universally) measurable 
map A —)■ tp. 

A Borel-measurable map from D to [0, 00) is called a random time. For any 
random time r, we define the shift-operator 6^ on D by Xr{9r{uj)) = Xx(uj}+r{^)- 
Further, for a random time r and any two paths S D such that Xr{u}) = 

Xq{uj'), the concatenation oj uj' is an element of D specified by 

Xt{uj uj') = 'i-{t<Tiuj)}Xtiuj) + l{t>T{uj)}Xt-T{uj){uj'). 

For a random time r, a probability measure /i G ip and a universally measurable 
kernel u, we then define the concatenation /i 1/ as the probability measure in fp 
given by 

{^*xiu){A)= JJ 1a{uj oj')vx^(^uj)iduj')^{duj), Ae;B(D). 

We let F° = denote the filtration generated by the co-ordinate 

process X, and let F = {.7>}rG[o,oo) be its right-continuous hull; i.e. = ns>r-F'g, 
for r > 0. For x = (^, t, a) G E, we denote by Vx the set of probability measures in 
tp for which: 

(i) Xq = x a.s., 

(ii) is a measure-valued F-martingale, 

(iii) Tr is non-decreasing with limr_>.oo = 00 , a.s., 

(iv) Ar = a+ 5„_dT„ a.s., where C = f a;^.(da;) and tq = inf{r : Tr > T}. 
Finally, we note that according to Lemma 3.12 in [46| , there exists a measurable 
functional A = (^, T, A) : D —>■ A such that X(uj) = limt_).oo At(w) whenever the 
limit exists and X(dt(uj)) = X(uj) for all t > 0. We let G(oj) := F(A(uj)); for any 
fj. G Vx, X G E, we then have that G = limt_).oo F{At) a.s. We define the problem: 


(A.l) 


v{x) = sup E'" [G]. 


Lemma A.l. The value function defined in (A.l) coincides with the value function 


given in (3.11. In particular, x 1—>■ v(x) is continuous. 


Proof. Let (11, (t/^),F, (Cr-)) (i^r)) be a multiple as specified in Problem 2.10 


Without loss of generality, let x = (/r, 0,0) (Note that in Problem 2.10 


it was 


assumed only that ^0- = x; by considering a time transformation t 
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this difference can be seen to be irrelevant). Since any martingale is a martingale 
in its own filtration, it follows that any such multiple induces on the canonical 
space D a measure G Vx- Conversely, any probability measures fi G Vx together 
with the space (D,,B(D),F) and the canonical process (^,T) produces such a mul¬ 
tiple. Indeed, the fact that one may, without loss of generality, assume that T. 
is absolutely continuous a.s. follows as in the proof of Lemma Moreover, 

since tq < oo a.s., for any pair {Tr,^r) with ^ a measure-valued martingale, one 
may construct a terminating measure valued martingale such that (Tr,^r) yields 
the same value of the payoff. The continuity is then an immediate consequence of 
Lemma 13.11 □ 


Remark A.2. Let F° be the filtration generated by (T, ^), where C = f x^ (dx), 
and let F be its right-continuous hull. Further, let Vx denote the set of measures 
in tp which satisfy properties (i) to (iv) above with the difference that ^ is only 


assumed to be a measure-valued F-martingale. We then have that 


(A.2) 


v(x) = sup [G], 


Indeed, this follows from the proof of Lemma 2.11 where the constructed measure¬ 
valued martingales are indeed adapted to the filtration generated by T. and . 

We are now ready to state the DPP. For simplicity we provide it here for bounded 
payoff functions. We denote by T the set of finite F-stopping times. 


Theorem A.S. Let F : K+ -G K+ he bounded and Lipschitz. Then, for all x G E 
and T G T, 

v{x) = sup E [i;(Ai-)]. 


Proof. Given e > 0, x G E and t gT, take fi GVx such that v{x) — e < E^[G]. Let 
Pa, be the regular conditional probability distribution of 0r under p, given Xx = x; 
since D is Polish it exists p, o A“^-a.s. Recall that for any / G Gb(IR_|_), C (/) 
is a bounded /r-martingale. By use of the same argument as given in the proof 
of Proposition 3.11 in 46 , we may then conclude that ^. (/) is a Ua;-martingale 
for p o A“^-almost all x G E. It follows that Vx G Vx for p o A“^-almost all 
X G E. As argued in the proof of Proposition 2.5 in [46| , we may further pick a 
universally measurable version of Vx such that Vx G Vx for all x G E. Now, note 
that G(a;') = G(a; w') for all w,a;' G D with A^(w) = Xq{uj'), and thus 


jj G(w')ux,(a;)(da;')ir(da;) 


G{uj *r uj')iyxx{u>){duj')p{duj). 


By use of the properties of the r.c.p.d., we thus obtain the following line of equalities: 

E^[G] =E'"[Go6»^] =E'"*"''[G] = Ef^[g{Xx)], 

where g{x) = E'^‘“[G] = / G{oj')h'x{duj'). Hence, v{x) — e < E^[u(At-)] for some 
p G Vx, and since e was chosen arbitrarily we obtain v{x) < sup^gp^ E [v(At-)]. 

In order to argue the reverse inequality, for any e > 0, we first argue the existence 
of a measurable kernel v with Vx G Vx and E'^^' [G] > v{x) — e, for each x G E. To 
this end, we define a mapping E x 3 B {x,ui) t-G- G D such that for each 

X = G E, the mapping Co = >->■ a^{Co) := oo^’^ = modifies 

the path ui as follows: 

^r-ldy) = /f,.(da:)TO?(a;,dy), 

t’p — t ^Q, 

= “ + / x^u-{dx)dtu, 


(A.S) 


a. 
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where the family m^{-,dy) satisfies Wi(Co)C) = If ~ yl^o(dx)m^(x,dy), and 
To = inf{r : tr = T}. Then (a;,w) is B(E) x ;B(D) measurable. Hence, for 

any P S U^^e'Px, defining Pa, := P o x £ E, yields a measurable kernel D 

with € Vx- Indeed, the martingale property of ^ under Dx follows as in the proof 
of Lemma 3.1 Further, from ( A.3| ) we have that 

fT r !■ 


K^OO 


< 


/to 

+ 


a;^j-i(da;)— / a;^j-i(dx) 


ds 


' T—{t—to) 


x^j-i(dx) + / x^(-i(dx) ) ds + |ao — 


Proceeding as in the proof of Lemma 3.1 for any P S P® with x = (^, t, a), we then 
have that 

E^[|Aoo(w) - Aoo(a'* o w)|] < r yV’i(^,^) + (t - t) j x^{dx)\/ j x^{dx) + \d - a\, 

and with Vx = P o (a^) “ ^, the Lipschitz property of E thus yields | E®* [G] — E'^“’ [G] | < 
6^{d{x, x)) for some modulus of continuity S^. Now, let £ > 0, and let {a;"}„eN be a 
countable dense subset of E. For each n, let P„ G Vx^ such that E'’”" [G] > u(x”) — |. 
Further, for each x", let r„ such that for all x G := {x G G : d(x,x”) < r„}, 
it holds that v(x^) > v(x) — | and |E’^"[G] — E‘"?[G]| < | with i/” := P„ o 
the existence of such r„, n G N, follows from the above and Lemma [3.1[ We then 
define the measurable kernel (i^x)xge by 

(A.4) z/j; := ^ lc"(x) P„ o 


where G” = 5" \ IJ 


nGN 


fc-1 


By construction, for x G G", n G N, we then have that 

E-^x [G] > E’P’" [G] - > v(x^) -^e> v{x) - e. 

O O 

Hence, i/ is a measurable kernel with iXx G Vx and E^“= [G] > v{x) — e, for x G E. 

In order to conclude, we take xq G E, y G Vxq, t G T and ix as constructed 
in (A.4). Since Cif) is a bounded /r-martingale for any / G G6(]R+), we may use 
the same arguments as in the proof of Proposition 3.10 in to deduce that Cif) 
is also a fi ;z-martingale. We may thus conclude that /x t/ G Vxg ■ Letting 
g{x) = E'^“=[G] and noticing that g is measurable, we thus obtain 

u(xo) > E'^^^'iG] = E'^[5(A,)] > E'^[x;(A,)] - £. 

Since e and g, G Vxo were both chosen arbitrarily, we obtain v{x) > sup^gp^ E [x;(At-)] 
and conclude. □ 

The above proof exploits the continuity properties of our problem in order to 
construct an approximately optimal measurable kernel; see and 13 for similar 
approaches. 
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